TUKE at MediaEval 2013 Spoken Web Search Task

نویسندگان

Jozef Vavrek

Matús Pleva

Martin Lojka

Peter Viszlay

Eva Kiktová-Vozarikova

Daniel Hládek

Jozef Juhár

چکیده

This paper provides a rough description of zero resource Query-by-Example retrieving system for the MediaEval 2013 spoken web search task. The proposed solution firstly implements the voice activity detection (VAD) utilizing variance of acceleration MFCC (VAMFCC) rule-based approach. A PCA-based segmentation, K-means clustering and GMM training are then used in order to built the posteriorgrams. Finally, two searching architectures based on posteriorgram matching (SDTW) and GMM modeling (GMM-FST) are evaluated. Results show that none of our systems is able to achieve the positive Actual Term Weighted Value, because of high number of insertions. We suppose that chosen clustering scheme caused generation of too many false alarms. Only provided data were used and no other resources were examined in any system component during the development.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ELiRF at MediaEval 2013: Spoken Web Search Task

In this paper, we present the systems that the Natural Language Engineering and Pattern Recognition group (ELiRF) has submitted to the MediaEval 2013 Spoken Web Search task. All of them are based on a Subsequence Dynamic Time Warping algorithm and are zero-resources systems.

متن کامل

TUKE MediaEval 2012: Spoken Web Search using DTW and Unsupervised SVM

This working paper provides the basic information about experiments conducted on audio documents within the MediaEval 2012 spoken web search evaluation project. The main purpose of these experiments was to build a robust and language independent system for spoken term detection. Therefore we have proposed query-by-example searching system based on the minimum-cost alignment of DTW algorithm and...

متن کامل

LIA @ MediaEval 2013 Spoken Web Search Task: An I-Vector based Approach

In this paper, we describe the LIA system proposed for the MediaEval 2013 Spoken Web Search task. This multilanguage task involves searching for an audio content query, in a database, with no training resources available. The participants must then find locations of each given query term within a large database of untranscribed audio files. For this task, we propose to build a language-independ...

متن کامل

TUKE at MediaEval 2015 QUESST

In this paper, we present our retrieving system for QUery by Example Search on Speech Task (QUESST), comprising the posteriorgram-based modeling approach along with the weighted fast sequential dynamic time warping algorithm (WFS-DTW). For this year, our main effort was directed toward developing language-dependent keyword matching system, utilizing all available information about spoken langua...

متن کامل

The Spoken Web Search Task

In this paper, we describe the “Spoken Web Search” Task, which is being held as part of the 2013 MediaEval campaign. The purpose of this task is to perform audio search in multiple languages and acoustic conditions, with very few resources being available for each individual language. This year the data contains audio from nine different languages and is much bigger in size than in previous yea...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2013

TUKE at MediaEval 2013 Spoken Web Search Task

نویسندگان

چکیده

منابع مشابه

ELiRF at MediaEval 2013: Spoken Web Search Task

TUKE MediaEval 2012: Spoken Web Search using DTW and Unsupervised SVM

LIA @ MediaEval 2013 Spoken Web Search Task: An I-Vector based Approach

TUKE at MediaEval 2015 QUESST

The Spoken Web Search Task

عنوان ژورنال:

اشتراک گذاری